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Abstract 


Managerial decision making is likely to be a dominant determinant of performance of 
teams in team sports. Here we use Japanese and German football data to investigate 
correlates between temporal patterns of formation changes across matches and match 
results. We found that individual teams and managers both showed win-stay lose-shift 
behavior, a type of reinforcement learning. In other words, they tended to stick to the 
current formation after a win and switch to a different formation after a loss. In addition, 
formation changes did not statistically improve the results of succeeding matches. The 
results indicate that a swift implementation of a new formation in the win-stay lose-shift 
manner may not be a successful managerial rule of thumb. 

1 Introduction 

Exploring rules governing decision making has been fascinating various fields of research, 
and its domain of implication ranges from our daily lives to corporate and governmental 
scenes. In economic contexts in a widest sense, individuals often modify their behavior 
based on their past experiences, attempting to enhance the benefit received in the future. 
Such decision making strategies are generally called reinforcement learning. In reinforce¬ 
ment learning, behavior that has led to a large reward will be selected with a larger 
frequency, or the behavior will be incrementally modified toward the rewarded one. Rein¬ 
forcement learning is common in humans DEI and non-humans [3], is implemented with 
various algorithms |4], has theoretical underpinnings HH, and has neural substrates [5116] . 

A simple version of reinforcement learning is the so-called win-stay lose-shift (WSLS) 
strategy W- An agent adopting this strategy sticks to the current behavior if the 
agent is satisfied. The agent changes its behavior if unsatisfied. Experimental studies 
employing human participants have provided a line of evidence in favor of WSLS in 
situations such as repeated Prisoner’s Dilemma mm, gambling tasks mm, and tasks 
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in which participants construct virtual stone tools |T3l - fT5] . It has also been suggested in 
nonscientific contexts that decisions by athletes and gamblers are often consistent with 
WSLS patterns even if the outcome of games seems to be independent of the decision Hi. 

Association football (also known as soccer; hereafter refer to it as football) is one of the 
most popular sports in the world and provides huge business opportunities. The television 
rights of the English Premier League yield over two billion euros per year mt Transfer 
fees of top players can be tens of millions of euros Hi- Various aspects of football, not only 
watching but also betting jT9] and the history of tactics (20], enjoy popularity. Football 
and other team sports also provide data for leadership studies because a large amount of 
sports data is available and the performance of teams and players can be unambiguously 
measured by match results [2T1 [23] . 

In the present study, using data obtained from football matches, we examine the 
possibility that managers of teams use the WSLS strategy. Managers can affect the per¬ 
formance of teams through selections of players, training of players, and implementation 
of tactics including formations [18]. In particular, a formation is a part of tactics to de¬ 
termine how players participate in offense and defense [24] and considered to affect match 
results [241125], Managerial decision making in substituting players during a match may 
affect the probability of winning [25]. We hypothesize that a manager continues to use 
the same formation if he has won the previous match, whereas he experiments on another 
formation following a loss in the previous match. 

The WSLS and more general reinforcement learning posit that unsuccessful individuals 
modify their behavior to increase the probability of winning. Therefore, we are interested 
in whether a formation change improves the performance of a team. To clarify this point, 
we also investigate effects of formation changes on the results of succeeding matches. 
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2 Materials and Methods 


2.1 Data set 

We collected data on football matches from two websites, J-League Data Site (officially, “J. 
League”) [26] on J-League, and Kicker-online j27] on Bundesliga. J-League and Bundesliga 
are the most prestigious professional football leagues in Japan and Germany, respectively. 
We refer to the two data sets as the J-League and the Bundesliga data sets. The two 
data sets contain, for each team and match, the season, date, manager’s name, result 
(i.e., win, draw, or loss), and starting formation. Basic statistics of the data sets are 
summarized in Table Q] The distributions of the probability of winning for teams and 
managers are shown in Fig. 1 for the two data sets. Because the strength of a team 
apart from the manager was considered to affect the probability of winning, in Fig. 1, we 
treated a manager as different data points when he directed different teams. The same 
caveat applies to all the following analysis focusing on individual managers (Figs. 3-6). 

Between 1993 and 2004, except for 1996, each season of J-League was divided into two 
half seasons. After the two half seasons had been completed, two champion teams, each 
representing a half season, played play-off matches. We regarded each half season as a 
season because intervals between two half seasons ranged from ten days to two months 
and therefore are longer than one week, which was a typical interval between two matches 
within a season. We also carried out the same analysis when we regarded one year, not 
one half season, as a season and verified that the main results were unaltered (Appendix 
A). 

We also collected data on Bundesliga from another website, Fussballdaten [28]. We 
focused on the Kicker-online data rather than the Fullballdaten data because the definition 
of the position was coarser for the Fussballdaten data (i.e., a player was not assumed to 
change his position during a season) than the Kicker-online data. Nevertheless, to verify 
the robustness of the following results, we also analyzed the Fussballdaten data (Appendix 
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B). 

2.2 Definition of formation 

The definition of formation was different between the two data sets. In the J-League data, 
each of the ten field players was assigned to either defender (DF), midfielder (MF), or 
forward (FW) in each match. We defined formation as a triplet of the numbers of DF, 
MF, and FW players, which sum up to ten. For example, a formation 4-4-2 implies four 
DFs, four MFs, and two FWs. 

In the Bundesliga data, the starting positions of the players were given on a two- 
dimensional map of the pitch (Fig. [2]). For this data set, we defined formation as follows. 
First, we measured the distance between the goal line and the bottom edge of the image 
representing each player along the vertical axis (e.g., 113 shown in Fig. 2). We referred to 
the HTML source code of Kicker-online to do this. The unit of the distance is pixel (px). 
The distance between the goal line and the half-way line is between 45 and 60 m in real 
fields. The same distance is approximately equal to 500 px in Kicker-online. Therefore, 
1 px in Kicker-online roughly corresponds to 10 cm in real fields. Although the HTML 
source code also included the distance of players from the left touch line, we neglected 
this information because the primary determinant of the player’s position seems to be the 
distance from the goal line rather than that from the left or right touch line, as implied 
by the terms DF, MF, and FW. Second, we grouped players whose distances from the 
goal line were the same. Third, we ordered the groups of players in terms of the distance, 
resulting in an ordered set of the numbers of players at each distance value. The set of 
numbers defined a formation. For example, when the distances of the ten field players are 
equal to 113 px, 113 px, 113 px, 113 px, 236 px, 236 px, 359 px, 359 px, 359 px, and 441 
px, the formation of the team is defined to be 4-2-3-1 (Fig. [2]). 

Among all matches in the Bundesliga data, the smallest nonzero distance between two 
players was equal to 31 px. Therefore, we did not have to worry about the possibility 
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that players possessed almost the same distance values while being classified into distinct 
positions. For example, there was no case in which the distances of two field players from 
the goal line were equal to 113 px and 114 px. 

For both data sets, a formation was defined as an ordered set of numbers, whereas the 
definition differs for the two data sets. For example, forward players possessing distance 
values 359 and 441 were classified into different positions in the Bundesliga data, whereas 
they belonged to the same position in the J-League data if they were both assigned to 
FW. In the following, we regarded that formation was changed when the ordered set of 
numbers differed between two consecutive matches. 

Figures |3(a)-(d) show the distribution of the probability that a team or manager 
has changed the formation in the two leagues. To calculate the probability of formation 
changes for a team, we excluded the first match in each season and the matches imme¬ 
diately after a change in the manager. As in the case of formation changes, we regarded 
that a manager was changed when the manager directed a team in a given match but 
did not do so in the next match. With this definition, a short absence of a chief manager 
due to illness, for example, may induce formation changes. However, we adhered to this 
definition because of the lack of further information on behavior of managers. In addition, 
as explained in section 2.1, we treated a manager as different data points when he led 
different teams. 

The frequency of formation changes as a function of time is shown in Figs. 3(e) and 
3(f) for J-League and Bundesliga, respectively. The figures suggest that the frequency 
of formation change is stable over years in J-League, but not in Bundesliga. Finally, we 
measured burstiness and memory coefficient [29] for interevent times of formation changes 
to quantify temporal patterns of formation changes. The results are shown in Appendix 
C. 
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2.3 GLMM 


To statistically examine whether patterns of formation changes were consistent with WSLS 
behavior, we investigated effects of previous matches and other factors on the likelihood 
of formation change for each team. If managers used the WSLS, the effect of the win and 
loss in the previous match on the likelihood of formation change should be significantly 
negative and positive, respectively. We used a generalized linear mixed model (GLMM) 
with binomial errors and a logit-link function. 

The dependent variable was the occurrence or lack thereof of formation changes, 
which was binary. As independent variables, we included the binary variable represent¬ 
ing whether or not the stadium was the home of the team (i.e., home or away) and the 
ternary result of the previous match (i.e., win, draw, or loss). We designated the draw as 
the reference category for the match result. Because the likelihood of formation changes 
may be affected by a streak of wins or losses, we also included the result of the second 
last match as an independent variable. The difference between the focal team’s strength 
and the opponent’s strength was also an independent variable. The strength of a team 
was defined by the probability of winning in the season. We estimated the strength of 
a team separately for each season because it can vary across seasons. The name of the 
manager was included as a random effect (random intercept). 

In this and the following analysis, we excluded the first match in each season for each 
team because we considered that the result of the last match in the preceding season would 
not directly affect the first match in a new season. In addition, we excluded matches im¬ 
mediately after a change of manager because we were not interested in formation changes 
induced by a change of manager. We further excluded the second match in each season 
for each team from the GLMM analysis when we employed the result of the second last 
match as an independent variable. Because the J-League data set did not have the infor¬ 
mation on managers between 1993 and 1998, we only used data between 1999 and 2014 in 
the GLMM analysis. We performed the statistical analysis using R 3.1.2 [20] with lme4 
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package [31] . 

2.4 Ordered probit model 

We also investigated the effects of formation changes on match results. We used the 
ordered probit. model because a match result was ternary. Because the strength was 
considered to heavily depend on teams, we controlled for the strength of teams. The 
same model was used for fitting match results in football in the Netherlands [32] and the 
UK PS]. 

The dependent variable of the model was a match result. We assumed that the oc¬ 
currence of formation change (change or no change), the stadium (home or away), the 
strength of teams, and the result of the previous match (win, draw, or loss) can affect 
a match result. As a linear combination of these factors, we defined the unobserved 
potential variable for team i, denoted by c^, by 


fI fi T ’i|j Hj T f3 w Wi + /AA T /?rU) (1) 

where fi — 1 if team i changed the formation, and fi — 0 otherwise; hi = 1 if the stadium 
was the home of team i, and h* = 0 otherwise; Wi — 1 if team i won the previous match, 
and Wi = 0 otherwise; ii = 1 if team i lost the previous match, and £ t — 0 otherwise; the 
strength of team i denoted by was defined as the fraction of matches that team i won 
in the given season. In Appendix D, we conducted the analysis by assuming that was a 
latent variable obeying the normal distribution and then using the hierarchical Bayesian 
model [33]. 

Consider a match between home team i and away team j. We assumed that the match 
result, denoted by k t j , was determined by the difference between the potential values of 
the two teams, i.e., 


Vij — • 


( 2 ) 


Variables y t j and k,^ are related by 


2 (home team wins) if c\ < y l3 + e^, 


hj — < 1 (draw) 


if c 0 < yij + 6ij < ci, 


(3) 


0 (home team loses) if y^ + e t j < cq, 




where Co and ci are threshold parameters, and is an error term that obeys the normal 
distribution with mean 0 and standard deviation 1. Because hi — hj — 1, (3^ appears as 
a constant term on the right-hand side of Eq. ([2]). In fact, it is impossible to estimate 
(3h because (3^ effectively shifts c 0 and C\ by the same amount such that there are only 
two degrees of freedom in the parameter space spanned by Co, ci, and (3h_. Therefore, we 
assumed Co = —c\ and estimated Co and f3 h. This assumption did not alter the estimates 
of the other parameters. Equation (|3]) results in 


P{kij = 2) = 1 - $(ci - yij), 

P{kij = 1) = $(ci -y^) - $(c 0 -y^), 
P{k i:j = 0) = $(c 0 -yij), 


(4) 

(5) 

( 6 ) 


where P denotes the probability, and $(•) is the cumulative standard normal distribution 
function. 

We excluded the matches that were the first game in a season at least for either 
team. We also excluded matches immediately after a change of manager in either team. 
Because the J-League data set did not contain the information on managers between 1993 
and 1998, we only used data between 1999 and 2014 in this analysis. We performed the 
analysis using R 3.1.2 [30j and maxLik package |_33|. 

2.5 Influence of individual manager’s behavior on match results 

Different managers may show WSLS behavior to different extents to respectively affect 
match results. Therefore, we analyzed data separately for individual managers. For each 
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manager i, we calculated the probability of winning under each of the following four 
conditions: (i) V s team won the previous match, and i changed the formation, (ii) i's 
team won the previous match, and i did not change the formation, (iii) i’s team lost 
the previous match, and i changed the formation, and (iv) i’s team lost the previous 
match, and i did not change the formation. We then compared the probability of winning 
between cases (i) and (ii), and between cases (iii) and (iv) using the paired t-test. In the 
t-test, we included the managers who directed at least ten pairs of consecutive matches in 
both of the two cases in comparison. In this and the next sections, we treated a manager 
as different data points when he directed different teams, as explained in section 2.1. In 
addition, we excluded the pairs of consecutive matches when the managers changed the 
team between the two matches. 

2.6 Degree of win-stay lose-shift 

To further examine possible relationships between manager’s behavior and match results, 
we looked at the relationships between the tendency of the WSLS behavior for each 
manager (degree of WSLS for short) and the probability of winning. The degree of WSLS 
is defined by 

degree of WSLS 

= | P (change | win) — Pwsls (change | win) | + \P (change) loss) — Pwsls (change | loss) | 

= | P (change | win) — 0| + |P(change|loss) — l| 

= P(change|win) + 1 — P(change|loss), (7) 

where Pwsls (change|win) (= 0) is the conditional probability that a perfect WSLS man¬ 
ager changes the formation after winning, and likewise for Pwsls (change | loss) ( = 1)- The 
degree of WSLS ranges from 0 to 2. 
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3 Results 


3.1 Win-stay lose-shift behavior in formation changes 

We examined the extent to which managers possibly changed the formation of the team 
after losing a match and persist to the current formation after a win. The results of 
the GLMM analysis with the results of the previous matches being the only independent 
variables are shown in Table [2j For both data sets, winning in a match significantly 
decreased the probability of formation change in the next match, and losing in a match 
increased the probability of formation change. The results did not essentially change when 
we used the full set of independent variables (Table |3]). Formation changes are consistent 
with WSLS patterns. 

For the J-League data, the effects of all the additional independent variables were 
insignificant. We analyzed the J-League data by regarding a pair of half seasons (i.e., 
an yearly season) as a season to confirm that the results remained qualitatively the same 
except that winning in the second last match also significantly decreased the probability 
of formation change (Appendix A). We also confirmed that matches played in the further 
past affect the probability of formation change to progressively small extents (Appendix 
E). 

For the Bundesliga data, winning in the second last match also significantly decreased 
the probability of formation change in the extended GLMM model (Table [3]). These re¬ 
sults are consistent with WSLS behavior. We also found for the Bundesliga data that 
stronger teams less frequently changed the formation and that a team would not change 
the formation to fight home games. We also investigated the Fussballdaten data for Bun- 
desliga, in which the definition of formation was different, and confirmed that managers 
tended to use the WSLS strategy (Appendix B). 


11 


3.2 Determinants of match results 


The results obtained from the ordered probit model are shown in Table SI For both data 
sets, formation changes did not significantly affect a match result. The result remained 
qualitatively the same when each pair of half seasons was considered as a season in the 
J-League data (Appendix A), and when the strength of team was assumed to be a latent 
variable in the ordered probit model (Appendix D). However, when the Fussballdaten data 
were used, formation changes significantly decreased the probability of winning (Appendix 
B). Table [4] also tells us the following. Trivially, stronger teams were more likely to win 
in both data sets. The home advantage was significant in both data sets, consistent 
with previous literature [T£1I55] . In Bundesliga, a win tended to yield a poor result in 
the next match. This is consistent with negative persistence effects reported in previous 
literature [18] , i.e., the results of the current and previous matches tend to be the opposite. 

Figure [Tfa) shows the probability of winning after individual managers changed or 
did not change the formation after a win in the J-League data. A large circle in Figure 
[4] represents a manager who presented both types of actions (i.e., formation change af¬ 
ter winning and no formation change after winning) at least ten times. A small circle 
represents a manager who presented either type of action less than ten times. The forma¬ 
tion change does not appear to affect the probability of winning. This is also apparently 
the case for the actions after a loss (Figure 01(b)) and the Bundesliga data (Figures 0](c) 
and |U(d)). The results also appear to be insensitive to the unconditional probability of 
winning, which roughly corresponds to the position along the diagonal in Figure 4. To 
be quantitative, we conducted the paired t-test on the managers who submitted the two 
types of actions at least ten times in each case (managers shown by the large circles in 
Figure 0J). For the J-League data, there was no significant effect of formation change 
on the probability of winning both after winning (p = 0.441, n = 3; corresponding to 
Figure 01(a)) and losing (p = 0.404, n = 4; Figure 01(b)). For the Bundesliga data, forma¬ 
tion changes after winning significantly decreased the probability of winning in the next 
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match (p = 0.026, n = 46; Figure 0](c)), whereas there was no significant effect after losing 
(p = 0.533, n = 42; Figure 0Jd)). These results suggest that formation changes did not 
at least increase the possibility of winning. 

The analysis with the ordered probit model aggregated the data from all managers. 
Therefore, we examined the relationship between the degree of WSLS and the probability 
of winning for individual managers. The results are shown in Figure [5] A circle in Figure 
[5]represents a manager. We did not find a significant relationship between the usage of the 
WSLS and the probability of winning for both J-League (Pearson’s r = 0.213, p = 0.411, 
n = 17) and Bundesliga (r = 0.058, p = 0.668, n = 58) data. 


4 Discussion 

We have provided evidence that football managers tend to stick to the current formation 
until the team loses, consistent with the WSLS strategy previously shown in laboratory 
experiments with social dilemma games pun] and gambling tasks PHH2]. Formation 
changes did not significantly affect (at least did not improve) a match result in most 
cases. This result seems to be odd because managers change formation to lead the team 
to a success. Generally speaking, when the environment in which an agent is located is 
fixed or exogenously changing, reinforcement learning usually improves the performance 
of the agent [4]. However, computational studies have suggested that it is not always 
the case when agents employing reinforcement learning are competing with each other, 
because the competing agents try to supersede each other [36b39j . The present finding that 
manager’s WSLS behavior does not improve team’s performance is consistent with these 
computational results. Empirical studies also suggest that humans obeying reinforcement 
learning does not improve the performance in complex environments. For example, players 
in the National Basketball Association were more likely to attempt 3 point shots after 
successful 3 point shots although their probability of success decreased for additional shots 
; iTil . Also in nonscientific accounts, it has been suggested that humans engaged in sports 
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and gambles often use the WSLS strategy even if outcome of games is determined merely 
at random [16] . We have provided quantitative evidence underlying these statements. 

Many sports fans possess the hot hand belief in match results, i.e., belief that a win or 
good performance persists [5T] . However, empirical evidence supports that streaks of wins 
and those of losses are less likely to occur than under the independence assumption [53]. 
By analyzing patterns of matches in the top division of football in England, Dobson 
and Goddard suggested the existence of negative persistence effects, i.e., a team with 
consecutive wins tended to perform poorly in the next match and vice versa |T8] . Their 
results are consistent with the present results; we observed the negative persistence effects, 
i.e., anticorrelation between the results of the previous and present matches. 

In the present study, we have neglected various factors that potentially affect the likeli¬ 
hood of formation change because our data sets did not contain the relevant information. 
For example, managers may change formations due to injuries, suspensions of players, 
and other strategic reasons including transfer of players. More detailed data will be able 
to provide further understanding of the relative importance of strategic versus accidental 
factors in formation changes. 

An important limitation of the present study is that we have oversimplified the con¬ 
cept of formation. Effective formations dynamically change during a match owing to 
movements of players. Because of the availability of data and our interests in the man¬ 
ager’s long-term behavior rather than formation changes during a match [25], we used 
the formation data released in the beginning of the matches. Based on recent technologi¬ 
cal developments, formations can be extracted from tracking data on movement patterns 
of players [42t.43]. Investigations on manager’s decision making using such technologies 
warrant further research. 
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Appendix A: Analysis of the J-League data on the 
basis of yearly seasons 


From 1993 to 2004, except for 1996, each season of J-League, spanning a year, was sub¬ 
divided into two half seasons. In the main text, we regarded each half season as a season. 
To examine the robustness of our results, we carried out the same analysis when we re¬ 
garded one entire season (i.e., one year), not one half season, as a season. The results were 
qualitatively the same as those shown in the main text (Tables EHI]) except that winning 
in the second last match significantly decreased the probability of formation change. 


Appendix B: Fussballdaten 

We analyzed data on Bundesliga from another website, Fussballdaten [28]. In the Fuss¬ 
balldaten data, each held player was assigned to one of the three positions (i.e., DF, MF, 
or FW) registered for an entire season. We defined the formation by counting the number 
of each type of held player in the same manner as that for the J-League data set. 

First, to examine possible existence of WSLS behavior by managers, we applied the 
GLMM analysis to the Fussballdaten data. The results shown in Tables [H] and E] are largely 
consistent with those for the Kicker online data (Tables 1 and 2). In particular, winning 
and losing in the previous match significantly decreased and increased the probability of 
formation change in the next match, respectively, consistent with WSLS behavior. 

Second, we also investigated the effect of formation change and other factors on the 
match result using the ordered probit model. Table [TO] indicates that formation changes 
have decreased the probability of winning. This result is not consistent with those for 
the two data sets shown in the main text. In addition, winning in the previous match 
decreased the probability of winning in the next match, indicating the presence of the 
negative persistence effect. This result is consistent with that for the Kicker online data 
(Table [T]). 
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Appendix C: Burstiness and memory coefficient of in¬ 
terevent time series 


To capture temporal properties of formation changes, in this section we calculated bursti¬ 
ness, 5, and memory coefficient, M, [29] on the basis of the interevent time series {r*} 
defined as follows. We calculated B and M for each manager. As in the main text, we 
treated a manager as different data points when he directed different teams. In the main 
text, we used interevent time series for individual seasons without concatenating different 
seasons. In this section, however, we use {rf} obtained by concatenating all seasons. For 
a given manager, we denote by t 0 , t\, .. ., tjv (2 < t 0 < t\ < ■ ■ ■ < tx) the times when 
the manager changes the formation. The number of formation change summed over all 
seasons is equal to N + 1. We counted the time in terms of the number of match rather 
than the day to exclude effects of variable intervals between consecutive matches in terms 
of the real time. If t? = 5, for example, the manager changed the formation to play the 
fifth match, and it was the third change for the manager since the first match in the data 
set. 

Managers sometimes moved from one team to another or did not direct any team. 
Because formation changes occurring as a result of a manager’s move or after a long 
absence were not considered to be strategic, we discarded the corresponding intervals. It 
should be noted that we did not mix interevent time series for a manager leading different 
teams. Then, a time series {tj} for each manager was partitioned into N k segments by 
either a manager’s move or absence. We denoted by W + 1 and K the total number of 
formation changes in the fcth segment and the total number of segments, respectively. It 
holds true that ^2 k=1 (N k + 1) = N + 1. We also denoted by (0 < i < N k ) the time of 
the ?'th formation change in the kth segment. The interevent time for formation changes 
was defined by r* = tf — t h -_ x (1 < i < N k ). It holds true that J^k=i T i = N +1 — K. 
In the following analysis, we used managers who directed at least 100 matches in a team 
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and has N + 1 — K > 10. 

The burstiness is defined by 


B = 


g jm — 1 a — m 
a/m + 1 a + m 1 


( 8 ) 


where m = Ef=i r t/( N + 1 — K) and a = E£( r f “ m) 2 /(N + 1 - K) 

are the mean and standard deviation of interevent time, respectively. B ranges between 
—1 and 1. A large value of B indicates that a sequence of formation change events is 
bursty in the sense that the interevent time obeys a long-tailed distribution. The Poisson 
process yields the exponential distribution and hence yields B = 0. 

The memory coefficient quantifies the correlation between two consecutive interevent 
times and is defined by 

K N k -1 

N-K 


M = 


_1_y- ( T i ~ m i)( r i+i ~ m 2 ) 

k =1 t=l 


(9) 


where 


g 1 = 


and 


cr 2 = 


K N k -1 

= EE 

k= 1 i— 1 

T k 

l 

(10) 

N-K ’ 

K N k 

= EE 

k= 1 t=2 

T k 

l 

(11) 

N-K 1 

K N k - 1 

EE- 

\ k= 1 i= 1 

(r k - mi) 2 

N-K ’ 

(12) 

K N k , 

yEE- 

\ k= 1 i=2 

rf - m 2 ) 2 

AT-A' 

(13) 


An uncorrelated sequence of interevent times yields M — 0. 

To examine the statistical significance of B value for each manager, we generated 10 3 
sequences of interevent times from the exponential distribution whose mean was equal 
to that of the original data. Each synthesized sequence had the same length (i.e., N) as 
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that of the original data. We calculated B for each synthesized sequence. We regarded 
that the value of B for the original data was significant if it was not included in the 95% 
confidential interval (Cl) on the basis of the distribution generated by the 10 3 sequences 
corresponding to the Poisson process. We calculated the Cl for M in the same manner 
except that we generated synthesized sequences by randomizing the original sequence of 
interevent times, instead of sampling sequences from the exponential distribution. 

Figure [6] shows histograms of burstiness, B, and memory coefficient, M, for the man¬ 
agers. The average values of B and M for interevent times of formation changes in the 
J-League data were equal to 0.145 and —0.137, respectively. Those for the Bundesliga 
data set were equal to 0.022 and —0.120, respectively. For both data sets, the average 
values of B were positive, and those of M were negative. The fraction of managers yield¬ 
ing significantly positive and negative B values were equal to 0.385 and 0, respectively, 
for the J-League data. Those for the Bundesliga data were equal to 0.372 and 0.244, re¬ 
spectively. These results suggest that in both data sets, a moderate fraction of managers 
changed formations in a bursty manner. In the Bundesliga data, however, some man¬ 
agers changed formations more regularly than expected from the Poisson process. The 
fractions of significantly positive and negative M values were equal to 0.077 and 0.077, 
respectively, for the J-League data. Those for the Bundesliga data were equal to 0.026 
and 0.026, respectively. In both data sets, the fractions of managers with significant M 
values were small, indicating that two consecutive interevent times were uncorrelated for 
a majority of managers. 


Appendix D: Hierarchical Bayesian model 

In the main text, we used the fraction of matches that a team won in a season to define 
the strength of the team. In this section, we analyze a model in which the strength of a 
team is assumed to be a latent variable. We used the hierarchical Bayesian ordered probit 
model combined with the Markov Chain Monte Carlo (MCMC) method [33]. The model 
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is the same as that used in the main text except for the derivation of the team strength. 
We assumed that the prior of the strength of team i in a season, denoted by r tl obeyed 
the normal distribution with mean 0 and variance a 2 . The priors of /5f, j3h, (3 W , and 
obeyed the normal distribution with mean 0 and variance 10 2 . The prior of a 2 obeyed the 
uniform distribution on [0, 10 4 ]. We conducted MCMC simulations for four independent 
chains starting from the same prior distributions. The total iterate per chain was set to 
25,000, and the first 5,000 iterates were discarded as transient. The thinning interval was 
set to 20 iterates. A final coefficient was regarded to be significant if the 95% credible 
interval did not bracket zero. We excluded the matches that were the first game in a 
season at least for either team. We performed the analysis using R 3.1.2 [3D] and RStan 
package [44] . 

Table [Til summarizes the results obtained from the Bayesian probit model. For both 
data sets, the credible interval of the coefficient representing the effect of the formation 
change brackets zero. Therefore, we conclude that formation changes have not affected 
the probability of winning. 


Appendix E: Cross-correlation analysis 


To further investigate possible relationships between formation changes and match results, 
we measured the cross-correlation between the two. In this analysis, we did not exclude 
the first match in each season. We used the teams that played at least 100 matches. We 
set fi t t = 1 if team i changes the formation in the fth match (2 < t < T*), where T % is the 
number of matches played by team i, and /,; )t = 0 otherwise; w^t = 1 if team i wins in the 
fth match, and Wj )t = 0 otherwise; A,t = 1 if team i loses in the fth match, and A,t = 0 
otherwise. We defined the cross-correlation between two time series {x ift } and {yij} by 


p(z,y,r) 


S iV T m E T, 2 f (x . t+f _ 


£^r m £ t V(^+f - x)y - y) 


~iTi~ f 




1 Ti—T / 


(14) 
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where x = (l/iV team ) xS^f m E^ 2 T x iit+f /(7],-f-l), y = (1/A^ team ) x 
f — 1), iVteam is the number of teams, and f is the lag. We measured the cross-correlation 
between formation changes and wins by 


p(f,w, t) 

if 

f > 0, 

(15) 

p{w,f,-f) 

if 

r < 0. 

(16) 


Replacing w by i in Eqs. ([13]) and (1T51) defines the cross-correlation between formation 
changes and losses. 

To examine the statistical significance of the cross-correlation obtained from the orig¬ 
inal data, we generated 10 3 randomized sequences of formation changes as follows. For 
given team i and positive lag f , we randomly shuffled the original sequence of formation 
changes, {/*, 2 +f, • • •, by assigning 1 (i.e., formation change) to each match with the 

equal probability such that the number of Is in the synthesized sequence was the equal 
to that in the original sequence. We generated a randomized sequence for each team. 
Then, we measured the cross-correlation between the randomized sequences of forma¬ 
tion changes and {wi j2 , ■ • •, f} or (4 2 , •.., 4?i—f} using Eq. (fl5|) . We repeated this 
procedure 10 3 times to obtain 10 3 cross-correlation values. The cross-correlation for the 
original data was considered to be significant for a given f if it was not included within 
the 95% Cl calculated on the basis of the 10 3 correlation coefficient values for the ran¬ 
domized samples. We also examined the statistical significance of the cross correlation for 
a negative lag on the basis of 10 3 cross-correlation values between randomized sequences 
of {/ i)2 , • • •, fi, Ti - f} and {wi' 2 +f, • • •, w iyTi } or {£ it2+f , • • •, 4tJ using Eq. ([ W>. 

The cross-correlation measured for various lags is shown in Figure [7] The cross¬ 
correlation value was the largest in the absolute value at f = 1. The effect of past 
matches on formation changes was mildly significant between r = 2 and t ~ 5. The 
sign of the effect (i.e., positive or negative) was the same for different lag values, which 
complies with the concept of WSLS. When t < 0, the cross-correlation was insignificant 
or weakly significant even if t ~ 0. This result is suggestive of a causal relationship, i.e., 
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a match result tends to cause a formation change. 
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Figures 


(a) J League, Team 



Win probability 


(c) Bundesliga, Team 



Win probability 


(b) J League, Manager 



Win probability 


(d) Bundesliga, Manager 



Win probability 


Figure 1: Distributions of the probability of winning, (a) Distribution for the teams in the 
J-League data, (b) Distribution for the managers in the J-League data, (c) Distribution 
for the teams in the Bundesliga data, (d) Distribution for the managers in the Bundesliga 
data. The colored bars correspond to the teams or managers that have played at least 
100 matches. 
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Figure 2: Definition of formation in the Bundesliga data. Kicker online gives the starting 
positions of the players as two-dimensional coordinates on the pitch. Field players with 
the identical distance from the goal line are aggregated into the same position. The 
starting positions shown in the figure are coded as 4-2-3-1. 
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Figure 3: Distributions of the probability of formation changes, (a) Distribution for the 
teams in the J-League data, (b) Distribution for the managers in the J-League data, (c) 
Distribution for the teams in the Bundesliga data, (d) Distribution for the managers in the 
Bundesliga data. In (a)-(d), the colored bars correspond to the teams or managers that 
have played at least 100 matches, (e) Probability of formation change in each season in 
J-League, aggregated over the different teams and managers. A circle represents a season. 
Because the J-League data set did not have the information on managers between 1993 
and 1998, we neglected changes of managers between 1993 and 1998. Between 1993 and 
2004, except for 1996, intervals between two circles are dense because a season consists 
of two half seasons, (f) Probability of formation change in each season in Bundesliga, 
aggregated over the different teams and managers. 
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Figure 4: Conditional probability of winning for individual managers after they changed 
or did not change the formation. A circle represents a manager associated with a team, 
who directed the team in at least 100 matches. A large circle represents a manager who 
showed both types of the actions (e.g., formation change after a win and no formation 
change after a win in (a) and (c)) at least ten times. A small circle represents a manager 
who presented either type of action less than ten times, (a) When the team won the 
previous match in the J-League data, (b) When the team lost the previous match in the 
J-League data, (c) When the team won the previous match in the Bundesliga data, (d) 

When the team lost the previous match in the Bundesliga data. 
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Figure 5: Relationship between the degree of WSLS and the probability of winning, (a) 
J-League. (b) Bundesliga. A circle represents a manager associated with a team, who 
directed the team in at least 100 matches. 
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(a) J League, B 



B 

(c) Bundesliga, B 
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(b) J League, M 
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(d) Bundesliga, M 
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Figure 6: Distributions of burstiness, B, and memory, M, across managers, (a) B for 
the J-League data, (b) M for the J-League data, (c) B for the Bundesliga data, (d) M 
for the Bundesliga data. The colored bars correspond to managers who have statistically 
significant values. 
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(a) 


J League, losses 


J League, wins (b) 



(c) Bundesliga, wins (d) Bundesliga, losses 



Figure 7: Cross-correlation between temporal patterns of formation changes and match 
results. Ranges between the dashed lines represent 95% CIs on the basis of the randomized 
sequences of formation changes. Cross-correlation between (a) formation changes and 
wins for the J-League data, (b) formation changes and losses for the J-League data, (c) 
formation changes and wins for the Bundesliga data, and (d) formation changes and losses 
for the Bundesliga data. 
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Tables 


Quantities 

J-League 

Bundesliga 

Year 

1993-2014 (1999-2014) 

1963-2014 

Number of seasons 

33 (22) 

51 

Number of matches 

5,944 (4318) 

15,548 

Number of teams 

29 (28) 

54 

Number of managers 

176 

372 

Number of wins (or losses) 

4,961 (3,435) 

11,543 

Number of draws 

983 (883) 

4,005 


Tabic 1: Statistics of the J-League and Bundesliga data sets. Values for the subset of the 
J-League data for which the information about the managers is available are shown in the 
parentheses. 
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Data set 

Variable 

Coefficient 

SE 

p-value 

J-League 

Win t _i 

-0.360 

0.101 

< 0.001 


Losst_i 

0.388 

0.093 

< 0.001 

Bundesliga 

Win t _i 

-0.179 

0.032 

< 0.001 


Loss t _i 

0.164 

0.032 

< 0.001 


Table 2: Results of the GLMM analysis when the results of the previous match were 
used as the sole independent variables. Win 4 _i denotes the binary variable representing 
whether the team has won the previous match (0: no win, 1: win). Likewise for Loss t _i 
(0: no loss, 1: loss). SE: standard error. 
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Data set 

Variable 

Coefficient 

SE 

p-value 

J-League 

Wiiq_i 

-0.299 

0.124 

0.016 


Wiip_2 

-0.204 

0.116 

0.079 


Loss t _i 

0.387 

0.120 

0.001 


Loss t _ 2 

0.146 

0.126 

0.248 


Wint_i x Win t _2 

0.005 

0.182 

0.979 


Losst_i x Loss t _2 

0.023 

0.164 

0.888 


Home 

-0.062 

0.072 

0.392 


Strength 

-0.192 

0.202 

0.343 

Bundesliga 

Wiiq_i 

-0.207 

0.040 

< 0.001 


Win t _ 2 

-0.117 

0.039 

0.003 


Losst_i 

0.136 

0.039 

< 0.001 


LoSSt_2 

0.007 

0.040 

0.867 


Win t _i x Win t _2 

0.025 

0.059 

0.676 


Loss t _i x Loss t _2 

0.108 

0.060 

0.072 


Home 

-0.118 

0.027 

< 0.001 


Strength 

-0.530 

0.082 

< 0.001 


Tabic 3: Results of the GLMM analysis when all the independent variables were consid¬ 
ered. Win t _j (i = 1,2) is the binary variable representing whether or not the team has 
won the (t — ?')th match (0: no win, 1: win). Likewise for Losst_j (i = 1, 2) (0: no loss, 1: 
loss). Home is equal to 0 for an away game and 1 for a home game. Strength is equal to 
the fraction of matches that the team has won in a season. SE: standard error. 
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Data set 

Variable 

Coefficient 

SE 

p-value 

J-League 

Formation change (Pi) 

0.112 

0.063 

0.075 


Home (/?h) 

0.087 

0.033 

0.009 


Win (p w ) 

0.082 

0.099 

0.405 


Loss (Pi) 

0.193 

0.100 

0.053 


Strength (/3 r ) 

2.889 

0.169 

< 0.001 

Bundesliga 

Formation change (Pi) 

-0.011 

0.016 

0.509 


Home (/3h) 

0.420 

0.011 

< 0.001 


Win (P w ) 

-0.060 

0.018 

0.001 


Loss (Pi) 

-0.006 

0.018 

0.738 


Strength (/3 r ) 

2.792 

0.060 

< 0.001 


Table 4: Effects of variables on match results as obtained from the ordered probit model. 
SE: standard error. 
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Variable Coefficient SE p-value 


Win*_i -0.352 0.101 <0.001 

Losst_i 0.398 0.093 < 0.001 

Table 5: Results of the GLMM analysis for the J-League data when a year was regarded 
as a season. The results of the previous match were used as the sole independent variables. 
Wirp_ i denotes the binary variable representing whether the team has won the previous 
match (0: no win, 1: win). Likewise for Losst_i (0: no loss, 1: loss). SE: standard error. 
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Variable 

Coefficient 

SE 

p-value 

Whp_i 

-0.306 

0.123 

0.013 

Wiiii_ 2 

-0.223 

0.111 

0.045 

Losst_i 

0.406 

0.119 

< 0.001 

Loss t _ 2 

0.145 

0.126 

0.249 

Win t _i x Win t _ 2 

0.021 

0.180 

0.909 

Loss t _i x LosSi _2 

0.003 

0.162 

0.985 

Home 

-0.062 

0.072 

0.388 

Strength 

-0.327 

0.219 

0.137 


Tabic 6: Results of the GLMM analysis with all independent variables for the J-League 
data when a year was regarded as a season. Win t _j {i = 1, 2) is the binary variable 
representing whether or not the team has won the (t — i)th match (0: no win, 1: win). 
Likewise for LosSi_j (i = 1,2) (0: no loss, 1: loss). Home is equal to 0 for an away game 
and 1 for a home game. Strength is equal to the fraction of the matches that the team 
has won in a season. SE: standard error. 
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Variable 

Coefficient 

SE 

p-value 

Formation change (/3f) 

0.053 

0.033 

0.109 

Home (/?h) 

0.134 

0.016 

< 0.001 

Win (/? w ) 

0.019 

0.034 

0.572 

Loss (/3 e ) 

0.079 

0.034 

0.020 

Strength (/3 r ) 

2.645 

0.093 

< 0.001 


Table 7: Effects of variables on match results for the J-League data when a year was 
regarded as a season. The ordered probit model was used. SE: standard error. 
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Variable Coefficient SE p-value 


Win*_i -0.482 0.030 < 0.001 

Loss t _i 0.407 0.031 < 0.001 

Table 8: Results of the GLMM analysis for the Fussballdaten data when the results of the 
previous match were used as the sole independent variables. Wup_i denotes the binary 
variable representing whether the team has won the previous match (0: no win, 1: win). 
Likewise for Lossj_i (0: no loss, 1: loss). SE: standard error. 
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Variable 

Coefficient 

SE 

p-value 

Win t _, 

-0.568 

0.038 

< 0.001 

Win t _ 2 

-0.234 

0.038 

< 0.001 

Loss t _i 

0.404 

0.038 

< 0.001 

t” 1 

o 

CZ3 

C/3 

1 

to 

0.018 

0.037 

0.628 

Wiip_i x Wiii (_2 

0.118 

0.056 

0.035 

LosSi_i x Loss t _2 

0.071 

0.058 

0.220 

Home 

-0.174 

0.026 

< 0.001 

Strength 

0.026 

0.078 

0.740 


Tabic 9: Results of the GLMM analysis for the Fussballdaten data when all the inde¬ 
pendent variables were considered. Win t _j (i = 1, 2) is the binary variable representing 
whether or not the team has won the (t — i)th match (0: no win, 1: win). Likewise for 
Losst_j (i = 1,2) (0: no loss, 1: loss). Home is equal to 0 for an away game and 1 for a 
home game. Strength is equal to the fraction of the matches that the team has won in a 
season. SE: standard error. 
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Variable 

Coefficient 

SE 

p-value 

Formation change (/3f) 

-0.034 

0.014 

0.019 

Home (/?h) 

0.420 

0.011 

< 0.001 

Win (/? w ) 

-0.066 

0.018 

< 0.001 

Loss (/3 e ) 

-0.003 

0.018 

0.858 

Strength (/3 r ) 

2.798 

0.060 

< 0.001 


Table 10: Effects of variables on match results for the Fussballdaten data. The ordered 
probit model was used. 
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Data set 

Variable 

Mean 

2.5% 

97.5% 

J-League 

Formation change (/%) 

0.004 

-0.081 

0.088 


Home (/?h) 

0.150 

0.113 

0.186 


Win (/3 W ) 

0.078 

0.003 

0.152 


Loss (Pi) 

0.068 

-0.007 

0.143 

Bundesliga 

Formation change (Pi) 

-0.029 

-0.062 

0.003 


Home (/?h) 

0.421 

0.401 

0.442 


Win (P w ) 

0.016 

-0.020 

0.051 


Loss (Pi) 

0.012 

-0.024 

0.047 


Table 11: Effects of variables on match results obtained from the hierarchical Bayesian 
ordered probit model. 
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